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Reef-building corals comprise multipartite symbioses where the cnidarian animal is host 
to an array of eukaryotic and prokaryotic organisms, and the viruses that infect them. 
These viruses are critical elements of the coral holobiont, serving not only as agents of 
mortality, but also as potential vectors for lateral gene flow, and as elements encoding 
a variety of auxiliary metabolic functions. Consequently, understanding the functioning 
and health of the coral holobiont requires detailed knowledge of the associated viral 
assemblage and its function. Currently, the most tractable way of uncovering viral diversity 
and function is through metagenomic approaches, which is inherently difficult in corals 
because of the complex holobiont community, an extracellular mucus layer that all corals 
secrete, and the variety of sizes and structures of nucleic acids found in viruses. Here 
we present the first protocol for isolating, purifying and amplifying viral nucleic acids from 
corals based on mechanical disruption of cells. This method produces at least 50% higher 
yields of viral nucleic acids, has very low levels of cellular sequence contamination and 
captures wider viral diversity than previously used chemical-based extraction methods. 
We demonstrate that our mechanical-based method profiles a greater diversity of DNA 
and RNA genomes, including virus groups such as Retro-transcribing and ssRNA viruses, 
which are absent from metagenomes generated via chemical-based methods. In addition, 
we briefly present (and make publically available) the first paired DNA and RNA viral 
metagenomes from the coral Acropora tenuis. 



Keywords: viral metagenomics, coral, holobiont, chloroform, virus diversity 



INTRODUCTION 

Marine viruses are the most abundant biological agents in the 
world's oceans, with quantities exceeding bacterial abundances by 
at least one order of magnitude (Wommack and Colwell, 2000; 
Suttle, 2005, 2007), and can infect members of all domains of life 
(Rohwer et al., 2009). Viruses play fundamental roles in the evo- 
lution and population dynamics of their hosts and are key players 
in oceanic biogeochemical cycling (Suttle, 2005). Despite more 
than two decades of research into viruses in the marine environ- 
ment, comparatively little research has been conducted on viruses 
associated with coral reefs. The term "coral holobiont" describes 
the entirety of the community comprised by a coral colony 
and includes the coral animal, as well as microscopic organisms 
including symbiotic dinoflageUates and endolithic algae, bacte- 
ria, fungi, archaea, and viruses (Rohwer et al., 2002). Over the 
past decade a range of methods have been employed to study 
viruses associated with coral reefs and the coral holobiont, includ- 
ing transmission electron microscopy (TEM) (Wilson et al, 2001, 
2005a; Davy et al, 2006; Davy and Patten, 2007; Lohr et al, 2007; 
Patten et al., 2008a), flow cytometry (Seymour et al., 2005; Patten 
et al, 2006, 2008b; Lohr et al, 2007), and metagenomics (Angly 
et al, 2006; Dinsdale et al, 2008; Marhaver et al, 2008; Thurber 
et al., 2008; Hewson et al, 2012). WhUe these studies have 



revealed a diverse array of viruses, coral virology is in its infancy 
and inherent methodological challenges still exist in this research 
field. Currently, there are no established coral cell lines and only 
two bacteriophages have been isolated and characterized; they 
infect the coral-associated bacterial pathogens. Vibrio coralliilyti- 
cus (Efrony et al, 2007) and Thallasomonas loyana (Efrony et al., 
2009). 

Viral metagenomics is defined here as the study of viral 
genetic material contained within an environmental sample. 
Metagenomics is a relatively new and promising tool for char- 
acterizing coral-associated viral communities but it also faces 
certain pitfalls and limitations that can influence data interpre- 
tation. Caveats include the limited number of viral sequences 
currently in public databases, biases introduced during the isola- 
tion of viral metagenomes from coral tissues and the complexity 
of the multi-compartmental nature of the coral holobiont. 

The main experimental objective of this study was to develop 
a method that best captures the diversity of both DNA and RNA 
viruses associated with the coral holobiont as assessed by metage- 
nomics. We compared our new method with existing methods 
for isolating viral metagenomes from corals (Marhaver et al., 
2008; Thurber et al, 2008, 2009; Hewson et al, 2012). These 
methods notably include a chloroform addition step, prior to 
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virus purification, and we demonstrate this can lead to exclu- 
sion of certain virus families in the final data. We also show that 
the amplification methods used can influence the data arising 
from sequencing viral metagenomes in coral tissue. Our method 
reduces potential biases for certain viral groups and captures 
a larger diversity of the viral community in coral tissues com- 
pared to other methods (Marhaver et al., 2008; Thurber et al., 
2008, 2009; Hewson et al, 2012). We initially validated this 
method for isolating viral DNA from the coral, Pocillopora dami- 
cornis (PociUoporidae), and subsequently for isolating viral DNA 
and RNA from a member of the Acroporidae {Acropora tenuis). 
Both families are ecologically important in Indo-Pacific coral reef 
systems (Veron, 1993). 

MATERIALS AND METHODS 

SAMPLING LOCATIONS AND COLLECTION OF CORAL TISSUE 

Field sampling occurred at Trunk Reef (18°20'49"S, 146°49'46"E) 
in November 2012 and in Pioneer Bay off Orpheus Island 
(18°38'3"S, 146°29'57"E) in March 2013, in the central Great 
Barrier Reef At Trunk Reef, approximately 45 g of coral tissue 
was sampled from three healthy, freshly collected coral colonies 
of Pocillopora damicornis. Approximately 20 g of Acropora tenuis 
tissue was sampled from three healthy, freshly collected coral 
colonies collected in Pioneer Bay. Fragments were washed in auto- 
claved, 0.02 (jim filtered virus-free seawater. Subsequently, tissue 
was blasted from the coral skeleton, using an air-gun, into 15 mL 
0.02 |im filtered (Anotop, Whatman) SM buffer (100 mM NaCl, 
8 mM MgS04, 50 mM Tris pH 7.5) in a zip-lock bag. 

CHLOROFORM EXTRACTION AND CESIUM CHLORIDE DENSITY 
GRADIENT CENTRIFUGATION 

For the P. damicornis samples, isolation of the viral metagenomes 
associated with the coral tissue was undertaken in a 3 -way com- 
parison of methodologies. The first approach was to replicate pre- 
viously published protocols for isolating viruses from coral tissue 
(Marhaver et al, 2008; Thurber et al, 2008, 2009) using a chlo- 
roform disruption step, which we term the chloroform (CFM) 
method. Briefly, 5 mL of chloroform per 40 mL of coral blas- 
tate was added and samples were agitated gently for 1 h at room 
temperature. Coral blastates were homogenized at 5000 rpm for 
1 min (Heidolph SilentCrusher'^"). Samples were immediately 
centrifuged at 1000 g for 15 min. The supernatant was transferred 
to sterile glass corex tubes and spun at 12,000 g for 15 min to pellet 
the majority of microbial cells (Beckman Coulter JA 25.50 rotor). 
A cesium chloride (CsCl) density gradient was then formed by 
layering 1 mL of 1.7, 1.5, and 1.35 g mL^^ CsCl into 13.2 mL 
UltraClear™ ultracentrifuge tubes (Beckman Coulter) with 9 mL 
sample layered on the top of the gradient. Gradients were then 
centrifuged for 2 h at 60,000 g at 4° C in a swinging bucket rotor. 

NEW MECHANICAL-BASED METHOD OMITTING THE USE OF 
CHLOROFORM 

A number of viruses are sensitive to chloroform as it acts to 
remove the lipid envelope surrounding the exterior of the viral 
capsid (Feldman and Wang, 1961; Ackermann, 2006). We devel- 
oped an alternative approach to avoid the use of chloroform, 
instead using mechanical disruption to break open host cells. 



which we term the mechanical (MECH) method. In addition, 
the MECH protocol was used to test the effects of storage in liq- 
uid nitrogen on samples post-tissue homogenization. Sampling 
of coral colonies in the field frequently means working in remote 
locations with limited access to resources specific to virus purifi- 
cation, such as an ultracentrifuge. Therefore, preservation of 
fresh coral tissue homogenate using liquid nitrogen is often nec- 
essary until further processing in a laboratory can occur. We 
tested the effect of storage in liquid nitrogen (LN2) prior to 
nucleic-acid isolation vs. the immediate processing of fresh coral 
tissue homogenate (no liquid nitrogen, NLN). A simplified out- 
line of the optimized method using MECH for generating viral 
metagenomes from coral tissue is shown in Figure 1. To standard- 
ize the sample for method testing, we pooled tissue homogenate 
from three P. damicornis colonies and subdivided the pooled sam- 
ple for processing by MECH (NLN and LN2) and the published 
CFM method for viral metagenome isolation, purification and 
amplification. 

In the MECH method, the coral tissue blastate was homog- 
enized at 10,000 rpm for 1 min and spun at 400 g for 5 min. 
The supernatant was then aliquoted into 1.5 mL aliquots in 2 mL 
eppendorf tubes containing 0.3 mL acid- washed glass beads (425- 
600 (xm diameter) (Sigma-Aldrich). The tubes were placed in 
a bead beater and cells were disrupted at 5000 rpm for 5 min. 
Tubes were centrifuged at top speed in a bench-top Eppendorf 
centrifuge for 1 min and the supernatant was collected for viral 
fractionation using step CsCl density gradients. To confirm that 
the MECH method was not disrupting virus particles, two dsDNA 
viruses, OtV-2 (Weynberg et al., 201 1) and EhV-86 (Wilson et al, 
2005b), were subjected to the same mechanical disruption pro- 
tocol. Flow cytometry was used to enumerate viruses before and 
after disruption. We could find no discernible differences in virus 
populations following MECH (data not shown). 

CsCl solutions were made with solid molecular-grade 
CsCl (Sigma-Aldrich) dissolved in 0.02 |xm filtered (Anotop, 
Whatman) SM buffer. A 3mL cushion of 1.6 g mL^^ CsCl was 
added to the bottom of a 13.2 mL UltraClear™ ultracentrifuge 
tube (Beckman Coulter). Next, 2.5 mL of 1.45 g mL^'and 1.3 g 
mL^' densities and 2 mL of 1.2 g mL^' were sequentially layered 
in that order on top of the 1 .6 g mL^ ' layer. The density of sample 
homogenate supernatant was adjusted to 1.12 g mL^' with CsCl 
and 2 mL of sample was placed on top of the layered gradient. 
Gradients were then centrifuged in an Optima XL-80K ultracen- 
trifuge (Beckman Coulter) in a swinging bucket rotor (SW 41 
Ti, Beckman Coulter) for 2.5 h at 40,000 rpm and 4°C. Fractions 
(0.5 mL) from the gradients were collected in 1.5 mL tubes using 
an 18 bore gauge needle and luer-lok syringe, puncturing the 
tube ~ 1 mL from the bottom. 

The density of fractions was determined gravimetricaUy and 
DNA concentration of each fraction was measured using a Quant- 
It Picogreen dsDNA High Sensitivity assay kit (Invitrogen, Life 
Technologies). Diafiltration and buffer exchange were performed 
to remove CsCl salts, as their presence may interfere with down- 
stream processing, such as DNA extraction. Fractions contain- 
ing the nucleic acid peaks were pooled and buffer exchange 
was performed with Amicon® centrifugal spin columns (30kDa, 
Millipore) against 0.02 (jim filtered SM buffer. The diafiltrated 
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Air-blast tissue from coral skeleton (3 x biological replicates) into 0.02 |jm SM buffer (Weigh coral fragment before and after air-blasting) 



Homogenise tissue 



Freeze in Liquid Nitrogen (LN2) No Liquid Nitrogen preservation (NLN) 

Bead-beat supernatant at 5,000 rpm for 5 min 

1 

Spin bead-beaten samples at 13,000 x g for 1 min. 
Collect supernatant 



Cesium chloride (CsCI) step density gradient 
ultracentrifiigation 



Density and nucleic acid quantification of CsCI 
gradient fractions 



Diafiltratlon (buffier exchange) and 0.2 pm Durapore® 
filtration of pooled fractions 



Method: Thurber ef at. 2009 (CFM) 



Nuclease treatment and nucleic acid (DNA +/or RNA extraction) 



RP-SISPA amplification 



It/lultiple Displacement Amplification 



Sequencing and bioinfbrmatics analysis 
FIGURE 1 I Flowchart outlining the methods for generating viral metagenomes from coral tissue trialed in this study. 



sample was then filtered using a 0.2 (Am pore size Durapore® 
syringe filter to remove remaining contaminating bacteria. 

NUCLEIC ACID EXTRACTION AND AMPLIFICATION FOR SEQUENCING 

All samples were treated with DNase and RNase (Ambion) 
prior to nucleic acid extraction. DNA was extracted and RNase 
treated using a MasterPure kit (Epicentre, lUumina) follow- 
ing manufacturer's instructions. RNA was extracted using a 
Qiagen QIAamp viral RNA kit (Qiagen) following manufac- 
turer's instructions, including the final DNase step (Ambion). 
Each amplification method is outlined in detail below. The ampli- 
fication methods used were Phi 29 polymerase-based Multiple 
Displacement Amplification (MDA) RepliG® (QIAGEN) tech- 
nology and a modified Random Priming-mediated Sequence- 
Independent Single-Primer Amplification (RP-SISPA) approach. 
Samples processed via the CFM method were only amplified 
using RepliG®, as this is similar to other Phi 29 polymerase-based 
amplification techniques used in this method (e.g., GenomiPhi®) 



(Thurber et al., 2008; Hewson et al, 2012). The NLN and LN2 
samples were amplified using both RepliG® and a RP-SISPA 
method for DNA viral metagenomes modified from a published 
protocol for amplifying RNA viruses extracted from seawater 
samples (CuUey et al., 2010). The modified method converts viral 
DNA to dsDNA through a two-step Klenow reaction, which also 
adds the primer sites to both DNA strands prior to amplification 
by PGR. 

AMPLIFICATION OF VIRAL DNA GENOMES WITH RepliG® 

In order to reduce some of the inherent biases in multi- 
displacement amplification (MDA), such as a preference for 
ssDNA viral genomes, DNA extractions were converted to dsDNA 
prior to amplification. Triplicate 10 |xL aliquots of the DNA 
extractions, containing ds and ssDNA viral genomes, underwent 
a single round of Klenow reaction (3'-5' exo-, 5U/|iL) by mix- 
ing 1.5 |iL of lOx reaction buffer (New England Biolabs Buffer 
2), 1.5 |iL of dNTPs (2.5 mM stock), 1 |iL of random hexamer 
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primers (SOng/jjiL, Invitrogen). The reaction was incubated at 
94° C for 3 min, then placed on ice for 3 min to allow for primer 
annealing before adding 1 [lL of Klenow (3'-5' exo-) and incu- 
bated at 25°C for 10 min, then 37°C for 60 min, with a termina- 
tion step of 75°C for 20 min. After termination, reactions were 
pooled and cleaned using a Qiagen QIAamp DNA mini kit and 
eluted in 50 |xL of Buffer AE. Replicate MDA reactions (n = 3 for 
each sample) were amplified using 2.5 [iL dsDNA template and 
the Qiagen RepliG® kit using the standard protocol. All reactions 
were run on a 0.8% agarose gel in 1 x TAE at 100 V for 30 min to 
confirm amplification, pooled and cleaned with QIAampl DNA 
minikit and eluted in 200 (jlL of Buffer AE. Negative controls were 
treated the same and also sent for sequencing to confirm that no 
viral contamination was present. 

AMPLIFICATION OF VIRAL DNA GENOMES WITH RP-SISPA 

As with the RepliG® protocol, Klenow Fragment (3'-5' exo-) was 
used to convert all DNA genomes to dsDNA using RP-SISPA 
primers with a 3' random hexamer sequence that is used for 
downstream PGR amplification. To label the first strand with the 
RP-SISPA primer, 5 |xL of nucleic acid was added to 9 [iL reac- 
tion mix containing 1.5 |iL of 10 x PGR buffer (New England 
Biolabs Buffer 2); 1.0 (iL of 2.5 mM dNTPs; 1.5 jiL of primer 
FR26RV-N (GGGGGAGGTGTGGAGATATGNNNNNN, 10 [aM 
stock) and 5 [lL of DNase-free distilled water. The reaction was 
incubated at 94° G for 3 min, then placed on ice for 3 min to allow 
for primer annealing before adding 1 [lL of Klenow Fragment 
(3'-5' exo-, 5U/|i,L, NEB #) and incubated at 37°G for 60 min. A 
second round of Klenow Fragment reaction (3'-5' exo) labeled 
the second strand with the SISPA primer, by adding an additional 
1 [iL of primer and 1 |xL dNTP, prior to another 94°G for 3 min 
heating step, then ice for 3 min before a final addition of 1 |xL 
of Klenow Fragment (3'-5' exo-). The reaction was incubated at 
37° G for 60 min then terminated at 75° G for 20 min. 

AMPLIFICATION OF VIRAL RNA GENOMES WITH RP-SISPA 

Amplification of viral RNA genomes with RP-SISPA was per- 
formed as described by GuUey et al. (2010) in Manual 
of Aquatic Viral Ecology (MAVE). Briefly, in preparation 
for cDNA synthesis, lOjxL purified RNA viral template was 
mked with 1 |iL of 2.5 mM dNTPs and 1.3 (iL of FR26RV-N 
(GGGGGAGGTGTGGAGATATGNNNNNN, IOm-M stock) and 
FR40RV-T primer (GGGGGAGGTGTGGAGATATG(T)20, 50 nM 
stock). The reaction was heated to 65°G for 5 min then cooled on 
ice for 3 min to allow the primers to anneal. While still on ice, 1 |xL 
DTT (Invitrogen) was added to the reaction as an enzyme sta- 
bilization reagent with 1 [iL RNase OUT (Invitrogen) to protect 
the sample from RNAse activity. The complementary DNA strand 
was synthesized with 200 U of Superscript III reverse transcrip- 
tase. The reaction was incubated initially at 25°G for 10 min to 
allow annealing of the hexamer 3' end of primer FR26RV-N and 
the poly(T)20 3' end of primer FR40RV-T to the template while 
cDNA synthesis commenced. The temperature was then increased 
to 50° G for 60 min. The first strand synthesis reaction was heated 
immediately to 94° G for 3 min and then rapidly cooled on ice. 
A complementary second strand was subsequently synthesized 
at 37°G for 60 min with the addition of 1 |iL Klenow Fragment 



(3'-5' exo-, 5U/|xL). The Klenow reaction was terminated with a 
final incubation at 75° G for 20 min. 

PGR amplification of the SISPA primer labeled template (DNA 
and RNA) was done in triplicate 25 |xL reactions containing 
2.5 (jlL lOx reaction buffer, 4(xL dNTPs (2.5 mM stock), 2 jxL 
FR20RV primer (GGGGGAGGTGTGGAGATATG, 10 (jlM stock), 
1 (xL of template, and 0.25 jxL of TaKaRa LA HS Taq polymerase 
(5U/|xL, Scientifix). The reaction was incubated at 95°G for 
10 min, followed by 30 cycles of denaturation at 95°G for 30 s, 
60°G for 60s, 72°G for 90s, and a final extension step at 72°G 
for 13 min to allow the completion of complementary strand syn- 
thesis. The PGR reactions were loaded on to a 0.8% agarose gel 
in IxTAE at 100 V for 30 min. If amplification resulted in visi- 
ble PGR products (typically a smear; products should be longer 
than 250 bp), a reconditioning PGR was performed on pooled 
reactions as follows. One reconditioning PGR contained 10 [lL of 
pooled SISPA reaction template, lOii-L lOx buffer, 16 (xL dNTP 
(2.5 mM stock), 8 [xL FR20RV primer (10 [xM stock) and 0.75 |xL 
TaKaRa LA HS Taq. The reaction was incubated at 95° G for 
10 min, followed by 5 cycles of denaturation at 95°G for 30 s, 60°G 
for 60 s, 72°G for 90 s, followed by extension at 72°G for 13 min. 
Reactions were cleaned and QG was assessed as above. 

SEQUENCING AND BIOINFORMATICS ANALYSIS 

After amplification, samples were cleaned with a QIAamp® DNA 
Mini kit (RepliG® amplification) or a MinElute® PGR purifi- 
cation kit (RP-SISPA). Samples were checked for quantification 
using a Quant-iT PicoGreen® kit on a NanoDrop 3300 flu- 
orospecrometer, for quality (260:280 ratios) on a NanoDrop 
2000, and were run on a 0.8% agarose gel in Ix TAE at 100 V 
for 30 min to confirm a size range appropriate for sequencing 
(~250-500bp) was present without contamination of smaller 
fragments. All metagenomes were sequenced using Nextera XT 
MiSeq 250 bp paired-end sequencing (lUumina) at the Ramaciotti 
Gentre, University of New South Wales, Sydney, Australia. 

Raw sequence reads were processed in GLG Genomics 
Workbench 5.5. Sequences were imported as lUumina paired-end 
reads, adaptor sequences were trimmed, and reads were checked 
for quality using a PHRED score of 20 and a minimum length 
of 100 bp. Paired reads were merged and a final data set con- 
taining merged reads and ORFans was checked again for QG 
with a minimum length of 200 bp. To carry out the taxonomic 
assignment, these non-assembled read data sets were uploaded 
to the Metavir web server, which is dedicated to the analysis 
of viral metagenomes (http://metavir-meb.univ-bpclermont.fr) 
(Roux et al., 2011). This server computes the taxonomic com- 
position using tBLASTx against the NGBI viral Refseq genomes 
(release 2013-09-12, bit-score = 50) and normalizes the results 
to viral genome length using the GAAS tool (Angly et al., 2009). 
All virus sequences were further classified into families using the 
taxonomic information from the top BLAST hit. Tetranucleotide 
clustering and rarefaction curves were generated using tools avail- 
able through Metavir. The P. damicornis metagenomes arose from 
the same biological samples and, therefore, tetranucleotide clus- 
tering was used to detect relative changes in sequence diversity 
caused by different methodologies. For the rarefaction curves, 
sequences were clustered at 75% identity because these data sets 
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FIGURE 2 I Density and viral DNA concentrations for each fraction 
collected from step CsCI gradients of the virus fraction isolated from 
Pocillopora damicornis coral tissue using the (A) Chloroform (CFM) 
method without Liquid Nitrogen preservation of tissue (B) No Liquid 
Nitrogen (NLN) preservation of tissue and the MECH method (C) 
Liquid Nitrogen (LN2) preservation of tissue and the MECH method. 
Density fractions that were pooled and used for metagenomic sequencing 
are highlighted in blue. 



originated from the same pooled tissue homogenate sample from 
P. damicornis. 

NUCLEOTIDE SEQUENCE ACCESSION NUMBERS 

The five datasets generated from the P. damicornis samples 
were submitted to Genbank Sequence Read Archive 
(SRA) and are available under the accession numbers 
SRR1207981, SRR1207983, SRR1207980, SRR1207984, and 
SRR1246941 (Supplementary Table 1). The two datasets gener- 
ated from the A. tenuis samples have also been deposited in the 
SRA under the accession numbers SRR1207979 and SRR1210582. 

RESULTS 

VIRAL DNA METAGENOME PURIFICATION BY CESIUM CHLORIDE 
DENSITY STEP GRADIENT CENTRIFUGATION 

AH samples were purified using cesium chloride (CsCl) den- 
sity gradient centrifugation, which is a standard technique for 
purifying both pure virus isolates and natural virus assemblages 
(Lawrence and Steward, 2010). The CFM method (Marhaver 
et al, 2008; Thurber et al, 2008, 2009) describes purifying 
coral-associated viral metagenomes using a CsCl density gra- 
dient centrifugation derived from standard protocols for pure 
cultures of bacteriophage lambda (X) under laboratory condi- 
tions (Sambrook et al., 1989). More recently, alternative CsCl step 
gradient protocols have become available that are better suited 
for the purification of diverse viral assemblages from environ- 
mental samples. The MECH method presented here uses a CsCl 
step gradient centrifugation method modified from Lawrence and 
Steward (2010). For all approaches, the CsCl densities and DNA 
quantities were measured for each sequential fraction removed 
along the gradient, from most to least dense (Figure 2). There 
was a difference in the density curve profiles generated for the 
CFM and MECH methods, following CsCl step-gradient cen- 
trifugation (Figure 2). The CFM method showed an initial steep 
decline in density in the denser half of the gradient, reaching a 
plateau of approximately 1.05 g mL^' throughout the remaining 
gradient (Figure 2 A). The MECH method resulted in a lin- 
ear decline in density from 1.6 g mL^^ to a final 1.17 g mL^' 
(Figures 2B,C). 

Nucleic-acid quantification by Pico Green assay showed the 
CFM method had an even distribution of nucleic acids (frac- 
tions yielded between 120 and 220 ng DNA) throughout 
the gradient, while the MECH method resulted in a bimodal 
distribution with peaks at ~1.4g mL^' and ~1.2g mL^^ for 
both LN2 (80-320 ng (jur^ DNA) and NLN treatments (~200- 
400 ng DNA) (Figures 2B,C). The published CFM method 
(Marhaver et al, 2008; Thurber et al, 2008, 2009) did not include 
gravimetric measurements of density of the collected gradient 
fractions, but stated that a 1.5 mL fraction should be removed 
from just below the original 1.5 g mL^^ step marked on the 
outside of the tube during layering (Thurber et al., 2009). As 
a literal interpretation of the protocol, we measured the den- 
sity of each fraction in the CFM CsCl step gradient and pooled 
the recommended density range of 1.35-1.5 g mL^^ (fractions 
1, 2) (Figure 2A) before proceeding to viral nucleic-acid extrac- 
tion and sequencing. It has been reported that the 1.35-1.5 g 
mL^^ fraction contains bacteriophages (King et al, 2011), but 



this fraction alone will not recover a wide diversity of the viruses 
present (Breitbart et al., 2002). Nucleic acid quantification of 
each fraction assists in deciding which fractions to target and 
pool for viral nucleic-acid extraction and sequencing as was done 
for both NLN and LN2 samples. For example, we pooled frac- 
tions 5-8 and 14-17 (Figure 2B) for the NLN samples, which 
corresponds to buoyant densities of bacteriophages and viruses 
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that infect eukaryotes, respectively (King et al., 2011). Although 
the pattern of CsCl step gradient profiles does not change, we 
found the DNA yield was 25-50% lower as a consequence of 
LN2 treatment of coral tissues compared to the NLN treatment 
(Figure 2C). 

In the MECH method, following removal and pooling of 
the indicated fractions from each CsCl step gradient, diafiltra- 
tion was performed to remove CsCl salts. Diafiltration, involving 
several buffer exchange steps with 0.02 [im filtered SM buffer 
to remove CsCl salts following density gradient centrifugation, 
has not previously been reported for coral viral metagenome 
isolation, although it is used in other aquatic virus isolation 
techniques. Following diafiltration, the pooled fractions were 
0.2 (jim Durapore® filtered to remove any residual cellular mate- 
rial. Although this may exclude some large viruses, the Durapore® 
membrane type allows smaller viruses to pass through and 
ensures residual cellular contamination is removed, a problem 
best dealt with prior to sequencing (Roux et al., 2013). Analysis 
of the metagenomic data sets using a HMM (Hidden Markov 
Model)-based detection tool revealed that 16S rDNA sequences 
(proxy for detection of cellular DNA contamination in viral 
metagenomes) (Roux et al., 2013) were less than 2 in 10,000 
sequences (Supplementary Table 1). 

CORAL-ASSOCIATED VIRAL METAGENOMIC DIVERSITY 

Viral DNA was extracted from P. damicornis colonies (n = 3), 
pooled and prepared for sequencing according to the CFM or 
MECH protocol. The nucleic acids in the viral fraction purified 
from coral tissues are typically not in high enough concentra- 
tion for sequencing, and therefore need to be amplified prior 
to sequencing. We tested a combination of different ampli- 
fication methods, as described, and compared the resulting 
metagenomic data. Information on the reads generated from 
the lUumina MiSeq sequencing is shown in Supplementary 
Table 1. Following sequencing, the proportion of sequences 
that matched to known viral sequences in the Genbank non- 
redundant database using tBLASTx (bit-score = 50) is shown 
in Table 1 and Figure 3. RP-SISPA-amplified samples yielded a 
higher total proportion of dsDNA viral-like sequence hits com- 
pared to amplification using RepliG® (Table 1). The largest pro- 
portion of sequence similarities in the dsDNA virus group was 
to bacteriophages (93.5-98.6% for RepliG®-amplified samples 
and ~99% for RP-SISPA-ampHfied samples) (Figure 3; Table 1). 
Amplification of samples using RepliG® resulted in a high pro- 
portion of hits to ssDNA viruses (Table 1), which is typical for 
MDA-amplified viral DNA from aquatic environments (Kim and 
Bae, 2011). However, relative to the CFM method, the RepliG®- 
amplified MECH samples had a higher proportion of sequences 
with similarity to dsDNA eukaryotic viruses. Notably, the CFM 
metagenomes exhibited the lowest diversity, with no identified 
representatives for retro-transcribed (RT) and ssRNA viruses 
(Figure 3; Table 1). For both amplification techniques, the LN2 
metagenomes had the highest proportion of hits to RT viruses 
(Table 1; Figure 3). Overall, ssRNA and unclassified archaeal 
viruses were absent from most samples, with only a very low per- 
centage of hits seen in the NLN and LN2 MECH metagenomes 
(Table 1). 



As an alternative way of assessing viral assemblage similari- 
ties and differences, we conducted tetranucleotide clustering on 
the viral sequences identified from the metagenomes (Figure 4A). 
Since the metagenomes originated from the same pooled coral tis- 
sue samples and therefore should have a similar oligonucleotide 
signature, we used tetranucleotides as a more sensitive approach 
than dinucleotides, as demonstrated previously (Pride et al., 
2003; Teehng et al., 2004). This analysis shows that both the 
method of cell disruption during virus isolation, as well as the 
nucleic-acid amplification method prior to sequencing, affects 
the viral metagenome composition. The NLN and LN2 MECH 
metagenomes clustered based on RepliG® or RP-SISPA amplifi- 
cation, but metagenomes from the CFM method (also amplified 
with RepliG®) had a tetranucleotide signal that was distinct from 
all other data sets (Figure 4A). To estimate the fraction of virus 
types sequenced and compare viral sequence diversity per unit 
sampling effort, we generated rarefaction curves (Figure 4B). 
Rarefaction curves plot the sequence clusters randomly sampled 
from a metagenomic data set as a function of the total number 
of sequence clusters sampled. The rarefaction curves sequences 
clustered at 75% similarity (Roux et al., 2011) showed that the 
CFM method had lower sequence diversity overall (Figure 4B), 
especially when compared to the MECH RepliG® amplified sam- 
ples (Figure 3). The MECH samples amplified by RepliG® had 
higher sequence diversity than those amplified with RP-SISPA, 
with rarefaction curves not saturating after 300,000 randomly 
sampled sequences. These results indicate that biases in coral viral 
metagenomes are caused by the genome amplification method, as 
well as other steps in viral nucleic acid isolation. 



Table 1 | Percentage of viruses Identified in DNA viromes by BLAST 
comparison to the NCBI viral Refseq genome database (bit-score = 
50, GAAS normalized). 



Virus type 


CFM 




MECH 








NLN 


LN2 


NLN* 


LN2* 


dsDNA viruses, no RNA stage 


11.3 


10.9 


6.6 


78.9 


71.9 


Bacteriophage 


(98.6%) 


(93.5%) 


(93.5%) 


(99%) 


(99%) 


Eukaryotic 


(1.4%) 


(6.5%) 


(6.5%) 


(1%) 


(1 %) 


Reverse transcribing viruses 


0 


1.8 


4.9 


0.4 


1.0 


Satellites 


3.3 


4.7 


6.1 


0.9 


0.1 


ssDNA viruses 


84.9 


82.1 


82.3 


175 


24.6 


ssRNA viruses 


0 


<0.1 


<0.1 


<0.1 


<0.1 


Unclassified archaeal viruses 


0 


<0.1 


0 


0 


<0.1 


Unclassified phages 


0.4 


0.4 


0.1 


2.2 


2.3 


Unclassified viruses 


<0.1 


0.1 


<0.1 


<0.1 


0.1 


Virophage 


<0.1 


0 


0 


<0.1 


<0.1 



The viral metagenomic data sets generated by the previously published chlo- 
roform treatment method (CFM) and our novel mechanical disruption method 
(MECH). The latter method was tested with (LN2) and without (NLN) liquid nitro- 
gen tissue preservation. Data sets with an asterisk (') were amplified using 
FIP-SISPA pnor to sequencing and were analyzed after removal of contaminating 
ssDNA phiX sequences (spil<ed during lllumina sequencing). All other data sets 
were amplified using RepliG'^ technology. Percent of dsDNA viruses as phage 
or eukaryotic viruses are highlighted in brackets. 
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FIGURE 3 I Annotation of Pocillopora damicornis DNA virome 
metagenomic sequences with a match to viral sequences for each 
method and treatment. Best matches to major viral groups were examined 
through Metavir, which uses tBLASTx algorithm against the NCBI viral 
Refseq genomes with a bit-score of 50 (data sets are publically available at 



http://metavir-meb.univ-bpclermont.fr, project "Coral virus — generating 
metagenomes"). Total numbers (per cent) of sequences for each viral group 
are indicated in Table 1. All samples were amplified using a RepliG® kit 
except for samples NLN* and LN2', which underwent RP-SISPA 
amplification. 



We employed the MECH method for both DNA and RNA 
viromes in five other coral species and present viral diversity data 
here for one of these, Acropora tenuis. The samples were col- 
lected at a remote field location and tissue homogenates were 
stored in liquid nitrogen prior to MECH. We amplified the DNA 
and RNA-virus fractions using the modified RP-SISPA method 
described here and an existing RNA RP-SISPA protocol (CuUey 
et al, 2010), respectively. The diversity data for DNA viruses gen- 
erated using our method, revealed that 84% of the sequences 
with hits were for dsDNA viruses (Figure 5A). The RNA viral 
sequences generated from A. tenuis were ~85.6% from ssRNA 
viruses (Figure 5B), with the majority of the identified hits 
(~99%) matching a major capsid protein (MCP) gene from the 
dinoflagellate-infecting ssRNA virus, Heterocapsa circularisquama 
RNA virus (HcRNAV) (bit-score = 50). This result supports 
the observation of sequence reads with similarities to HcRNAV 
previously reported in a Montastrea cavernosa study (Correa et al., 
2013). 



DISCUSSION 

We have developed and optimized a method for the extrac- 
tion and preparation of viral DNA and RNA from coral tissues 
for metagenome sequencing. Metagenomics can be a powerful 
tool for describing biological diversity in a sample without prior 
information on the nature of the sample. However, the bio- 
logical relevance and accuracy of metagenomic sequence data 
can be reflective of how nucleic acids (i.e., the metagenome) 
were obtained. We show that the methods used in viral nucleic 
acids preparation can introduce severe biases. The method pre- 
sented here minimizes biases and captures a larger fraction of the 
viral diversity in coral samples compared to previously published 
methods. 

Previously described methods for isolating viral nucleic acids 
from coral tissue have included a chloroform step in order to 
disrupt cellular material (Marhaver et al., 2008; Thurber et al., 
2008; Hewson et al., 2012). Certain virus types are sensitive 
to chloroform and other organic solvents, which denature the 



www.frontiersln.org 



May 2014 | Volume 5 1 Article 206 | 7 



Weynberg et al. 



Virus metagenomes from reef corals 



0.01 



CM 

in 



0.00 



LN2 SISPA 




















I 


□ 

NLN 


SISPA 
























































CFM RepliG 
A 


















































ln; 


RepliG 
















NLN RepliG 


















-0.01 0.00 


0.01 0.02 
MDS axis 1 


0.03 



140000 



O 120000 



100000 



8" 

in 
o 

3 

'E 



o 
n 
E 



80000 



20000 




50000 



100000 150000 200000 

Number of sequences sampled 



250000 



300000 



FIGURE 4 I Diversity analysis of the Pocillopora damicornis DNA 
virome data sets. Each data set was generated from a uniform tissue 
homogenate using either a chloroform-based method (CFM) or a 
mechanical bead-beating method (MECH) to disrupt cells. The latter 
method included either an additional liquid nitrogen storage step (LN2) or 
samples were processed immediately from fresh tissue (NLN). The 



nucleic acid amplification method used for each data set is indicated 
(RepliG® or RP-SISPA amplification). (A) tetranucleotide cluster analysis 
was used to demonstrate how the methods change nucleotide frequency 
(proxy for viral community diversity) (B) rarefaction analysis was used to 
illustrate the impact of methodology on perceived diversity within a viral 
metagenome. 



lipid envelopes surrounding virus capsids (Feldman and Wang, 
1961; Ackermann, 2006), and internal lipid membranes (Bamford 
et al, 1995), which are also a common feature of phycoviruses 
(Dunigan et al, 2006; Wilson et al, 2009) and other nucleo- 
cytoplasmic large DNA (NCLDV) viruses. Furthermore, viruses 
that do not possess a lipid envelope, such as ssDNA viruses 
belonging to the Inoviridae, and about a third of tailed bacterio- 
phages (Ackermann, 2006) also display sensitivity to chloroform. 
Disruption of the lipid envelope surrounding a virus is typically 
detected by a decrease in infectivity, but it also changes the physi- 
cal attributes of the virus including its size and buoyancy. Changes 
in the physical characteristics of a virus will directly affect out- 
comes of isolation and can lead to the exclusion of certain 
members of the viral community from sample preparations. We 
postulated that in order to capture as many representative virus 



groups as possible, chloroform should be avoided. To test this, a 
previously reported method (Marhaver et al., 2008; Thurber et al, 
2008, 2009) incorporating a chloroform addition step (CFM) 
was compared to the mechanical homogenization bead-beating 
(MECH) approach described here. 

Notably, there is a loss of sequence data for reverse- 
transcribing viruses in the chloroform-treated samples, likely 
because reverse-transcribing viruses have a lipid envelope. This 
is also a common feature of viruses in several other families (e.g., 
Herpesviridae, Poxviridae) (King et al, 2011). Therefore, chloro- 
form addition during the purification of viruses from coral tissue 
will affect the types of viruses present in the metagenome and 
results in lower diversity compared to the samples treated with 
the MECH approach. In addition, the CsCl step-gradient cen- 
trifugation described in our method maximizes the capture of a 
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FIGURE 5 1 Annotation of the Acropora tenuis (A) DNA and (B) RNA 
virome metagenomic sequences with a match to viral sequences for 
the IVIECH method following storage in liquid nitrogen and amplified 
by RP-SISPA. Best matches to the major viral groups were examined 
through Metavir, which uses the tBI_ASTx algorithm against the NCBI viral 
Refseq genomes with a bit-score of 50. 



wider diversity of viruses because it includes density and nucleic 
acid quantification for each fraction removed along the gradient 
to ensure that the appropriate fractions are targeted. 

Full processing of coral tissue after collection is often delayed, 
necessitating the use of liquid nitrogen for preservation. Our 
results demonstrate that storage of coral-tissue homogenate in 
liquid nitrogen prior to purifying viruses and extracting nucleic 
acids can result in lower starting yields of nucleic acids but 
does not significantly alter the representative sequences in the 
metagenomic data. 

Amplification methods cause artifacts and biases. We rec- 
ommend the use of the RP-SISPA methods presented here for 
amplifying viral DNA and RNA isolated from coral tissue using 
mechanical disruption. RP-SISPA has been used previously to 
detect and identify viruses through a metagenomic approach 



(CuUey et al, 2010; Karlsson et al., 2013), and has an advantage 
over MDA in that branching does not occur during amplification, 
the formation of chimeras is reduced and ssDNA is not preferen- 
tially amplified (Poison et al, 2011). However, RP-SISPA is not 
without its own issues and has been reported to show a template- 
dependent amplification bias, resulting in uneven sequencing 
depth within and among genomes (Victoria et al., 2009, 2010; 
Rosseel et al., 2013). RP-SISPA can have a bias for amplify- 
ing the dominant sequences in metagenomic samples (Karlsson 
et al., 2013), likely the reason that most of the sequences in 
the RP-SISPA metagenomes contain sequences from the dsDNA 
viruses, which have larger genomes relative to other viral types. In 
addition, PGR amplification is known to produce amplification 
artifacts, such as increased heteroduplex formation in samples 
with increased diversity (Qiu et al, 2001). Because of this, we 
have included a reconditioning PGR approach, where replicate 
amplification PGR reactions are pooled and then "reconditioned" 
through a low number of additional cycles in order to minimize 
heteroduplex formation (Thompson et al., 2002) and elimi- 
nate potential amplification biases within a single PGR reaction. 
Although our data indicate that RP-SISPA is preferred over MDA, 
caution must be exercised in order to avoid over-interpreting 
metagenomic data generated following amplification, especially 
regarding relative abundances of viral groups. 

The poor representation of many viral taxa in public databases 
limits the ability to interpret the results from metagenomic 
sequence data. For example, low levels of archaeal viruses in 
our coral viromes may result from the lack of representative 
sequences in the public databases. In all samples, the majority of 
dsDNA hits were to bacteriophages, which mirrors the findings 
in other metagenomic studies of viruses in reef corals (Marhaver 
et al., 2008; Thurber et al, 2008). It is notable that in the RNA 
metagenome data for A. tenuis there was a high proportion of hits 
matching a MGP gene found in a ssRNA algal virus (HcRNAV) 
that infects dinoflagellates (Tomaru et al., 2004). This indicates 
the potential presence of a virus associated with the dinoflag- 
ellate endosymbiont, Symbiodinium, in the coral holobiont and 
supports the observation of sequence reads with similarities to 
HcRNAV reported in a previous study of coral-associated viruses 
(Gorrea et al., 2013). As viral genomes can be DNA or RNA, 
double- or single-stranded (King et al., 20 11 ), coral viromic stud- 
ies should try to extract both RNA and DNA viruses in an attempt 
to capture as much of the virus community as possible. 

CONCLUSION 

We developed a method to generate viral metagenomes from the 
coral holobiont. This new method uses a mechanical approach 
to disrupt host cells and avoids the use of chemicals, such as 
chloroform, which we demonstrate can reduce the quantity and 
diversity of the viruses targeted. We also note that caution must be 
exercised when drawing inferences from viral metagenome data 
generated using currently available amplification methods, due 
to possible introduced biases in the processing prior to sequenc- 
ing. The method presented here has been tested on coral species 
from different taxonomic families and performs well in uncov- 
ering a wider diversity of the viromes present in coral tissue 
samples. 
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